Inventi Impact: Signal Processing

Articles

Inventi:esp/21293/16

Joint Training of DNNs by Incorporating an Explicit Dereverberation Structure for Distant Speech\nRecognition

01-Jan-1970 Research 2017 : January - March

Tian Gao, Jun Du, Yong Xu, Cong Liu, Li-Rong Dai, Chin-Hui Lee

We explore joint training strategies of DNNs for simultaneous dereverberation and acoustic modeling to improve the\nperformance of distant speech recognition. There are two key contributions. First, a new DNN structure incorporating\nboth dereverberated and original reverberant features is shown to effectively improve recognition accuracy over the\nconventional one using only dereverberated features as the input. Second, in most of the simulated reverberant\nenvironments for training data collection and DNN-based dereverberation, the resource data and learning targets are\nhigh-quality clean speech. With our joint training strategy, we can relax this constraint by using large-scale diversified\nreal close-talking data as the targets which are easy to be collected via many speech-enabled applications from\nmobile internet users, and find the scenario even more effective. Our experiments on a Mandarin speech recognition\ntask with 2000-h training data show that the proposed framework achieves relative word error rate reductions of 9.7\nand 8.6 % over the multi-condition training systems for the cases of single-channel and multi-channel with\nbeamforming, respectively. Furthermore, significant gains are consistently observed over the pre-processing\napproach using simply DNN-based dereverberation.

How to Cite this Article
CC Compliant Citation: Gao, Tian, et al. \"Joint training of DNNs by incorporating an explicit dereverberation structure for distant speech\nrecognition.\" EURASIP Journal on Advances in Signal Processing 2016.1 (2016): 86, DOI 10.1186/s13634-016-0384-5, https://\ncreativecommons.org/licenses/by/4.0/.
Download Full Text

Call Us: +4 (800) 888-0008

Inventi Impact: Signal Processing

Articles

Inventi:esp/21293/16

Joint Training of DNNs by Incorporating an Explicit Dereverberation Structure for Distant Speech\nRecognition

How to Cite this Article

Links

Contact Us